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It is shown that the excellent Murchison Radio-astronomy Observatory site allows the Murchison 
Widefield Array to employ a simple RFI blanking scheme and still calibrate visibilities and form 
images in the FM radio band. The techniques described are running autonomously in our cali- 
bration and imaging software, which is currently being used to process an FM-band survey of the 
00 entire southern sky. 
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1. Introduction 

The Murchison Widefield Array (MWA) is both a pathfinder to the Square Kilometre Array 
and a stand-alone instrument, designed for cutting-edge science between 80 and 300 MHz via 
real-time imaging. This frequency band is rich with scientific potential, key science drivers being 
detection of redshifted 21 cm emission from neutral Hydrogen during the Epoch of Reionization, 
detection of radio transients, and space weather monitoring. However, the band can be difficult to 
observe in due to high levels of RFI. To reduce the need for complicated interference mitigation 
strategies, a remote site has been chosen so that RFI-free parts of the spectrum are available across 
the entire frequency range. While the 1024-input correlator and high-throughput software backend 
make the MWA a state-of-the-art instrument for advanced cancellation techniques, simple blanking 
methods have the advantage of simple statistics and are well suited to the low spectral and temporal 
occupancy at the site. Maintaining simple statistics is particularly important in the FM radio band, 
where it is anticipated that part of the weak neutral Hydrogen signal emitted during the Epoch of 
Reionization lies. 

In addition to the excellent site, several design features help the MWA confront other chal- 
lenges faced by modern and future radio facilities. For example, restricting the maximum baseline 
to a few kilometres reduces ionospheric distortion models to two dimensions, and real-time snap- 
shot imaging allows the removal of these and various wide-field distortions in the image plane. 
Real-time processing also enables the large visibility data rate that comes with 512 antennas. A 
consequence of real-time processing and high data rates is that RFI algorithms need to be robust 
and efficient. A description of one such approach follows. 

The Murchison Radio-astronomy Observatory 

The Murchison Radio-astronomy Observatory (MRO) is a protected site in remote Western 
Australia that has been set aside as a park for a number of next-generation radio facilities. It consists 
of a zone, with a radius of 100 to 150 km, in which radio emission is restricted. While this level of 
protection is imperative for detection of extremely weak target signals, terrestrial radio waves can 
travel large distances via reflections off aircraft or meteor trails, or by occasional refractive ducting, 
while signals from satellites and aircraft can be in the direct line of sight. 

FM radio signals from communities outside the restricted zone are an example of signals that 
will at times be reflected into the MRO. However, removing a modest amount of contaminated data 
will preserve this important band. A neighbouring instrument, the Experiment to Detect the Global 
Epoch of Reionization Signature (EDGES), removed all detected FM interference by filtering 3 
percent of their 130 MHz band for up to 15 percent of the time during a typical day, with statistics 
collected over approximately 80 days [1]. 

2. The Real-Time Calibration and Imaging System 

The MWA will consist of 512 antenna tiles, each a 4x4 array of dual-polarised, vertical bowtie 
dipoles. It is currently in a final design phase, with the first 32 antennas - along with prototype 
beamformers, receivers, correlator boards and computing hardware - being regularly used in an 



2 



The Murchison Widefie Id Array 



Daniel A. Mitchell 



end-to-end pipeline. The primary aim of the 32-tile system is to test hardware and firmware before 
the build out of the full array, and to aid the design of real-time software systems. 

With the challenging dynamic range goal of a million to one in mind, a comprehensive, wide- 
field, real-time calibration and imaging system has been designed (see [2] and [3]). While the 
bulk of the real-time system is running routinely on simulated data using clusters with substantial 
graphics-card acceleration [4], for the 32-tile array the design has been significantly simplified. In 
keeping with the series of papers that accompany [5], and if the visibilities are dominated by a 
strong, unresolved, calibrator source, one can write the measured coherency matrix (containing the 
four polarisation terms of the visibility), after passing through antennas j and k, as 

Y Jk = Jj (Sexp{-^» jj + N#, (2.1) 

where a dagger superscript denotes a conjugate transpose, 0^ is the phase shift due to differences in 
the path length from the source to the two antennas, and each bold symbol in (2.1) represents a 2 x 2 
matrix; S is the coherency matrix of the calibrator signal, the Jones matrices, J ; and J^, describe 
the response of the antennas, and Njk contains the covariance of the polarised components of the 
noise. This noise comes mainly from the receivers and from other radio sources and, in the absence 
of RFI and significant secondary sources, we might expect it to have a Gaussian distribution. This 
is discussed further in section 3. 

With a model for the source, Sexp{— i(j)jk}, and noise with an approximately Gaussian distri- 
bution, we can estimate the Jones matrix for each of the M antennas towards the calibrator using 
a least- squares approach (any amplitude or phase errors in the source model will be absorbed into 
these estimates). Following the matrix formalism suggested in [6] and described in [2], we search 
for the matrices Jj that minimise 
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(2.2) 



where, for arbitrary matrix A, the squared Frobenius norm 1 1 A 1 1 \ = Tr ( A A 1 " ) , and aj k = ( 1 1 1 1 2 F ) . 
The solutions of (2.2), 

Jj=(t G Jk exp{+i^}) %A ( £ a/s J{ %S*) , (2.3) 
VWj / VWj / 

can be estimated from an initial set of Jones matrices and iterated until they converge to an optimal 
set, which can then be tracked. 

The MWA is a snapshot instrument that uses (2.1) and (2.3) to produce calibrated images every 
eight seconds for each frequency channel and polarisation. 1 The images are regridded in real time 
to remove ionospheric and wide-field distortions, then integrated for several minutes. Eventually 
(2.3) will be updated for many calibrators, to build up direction-dependent gain models, but for the 
32-tile array it is assumed that the shape of the primary beam response of each antenna is known, 
so that the models can be fixed by a single calibrator. It is also assumed that the primary beam 
shapes are the same for each antenna, so that the fixed beams are equal. These issues are described 



1 A separate algorithm is used to estimate deviations in (j)^ arising from the ionosphere. 
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in more detail in [7], where the real-time system was applied to the 32-tile array to track fields and 
integrate for many hours over much of the band from 90 to 210 MHz. 

2.1 Flagging in the Real-Time System 

With all of the dimensions that can be used to describe a typical observation - time, frequency, 
polarisation, antenna space, visibility space, image space, etc. - RFI mitigation will in general be 
most effective if one can find a projection in which the interference has little or no overlap with 
the signals of interest. For RFI that is contained within small segments of time and frequency, 
these two dimensions are obvious places to define the segmentation, and traditional techniques for 
blanking samples in time and frequency offer close to complete removal of the RFI at the expense 
of losing a small amount of the signal of interest. 

The calibration and imaging system has a separate processing pipeline for each frequency 
channel, and this coupled with the real-time nature of the instrument makes it well suited to time 
and frequency blanking. While there are a number of places to set flags that can be used throughout 
the system to blank visibilities, it is only being done at the moment at the start of the software 
system. There are two detectors in use; one generates statistics for each frequency channel using 
neighbouring time samples, the other generates statistics for each time step using neighbouring 
frequency samples. Both detectors search for outliers using quartiles, where departures from the 
median value by more than a set multiple of the interquartile range are flagged for blanking. The 
interquartile range is a robust estimator of signal variance and is relatively cheap to compute (see, 
for example, [8]). Each visibility has an associated non-negative inverse-variance weight, and flags 
are set by making these negative. The calibration and imaging routines ignore samples that have a 
negative weight, and accumulated quantities that are missing samples due to flagging can simply 
be down weighted. 

Development is underway to combine these detectors with goodness-of-fit estimates for the 
relations given at the start of this section. Also, since images are formed every eight seconds, 
various image processing tools can be run before integration, as a final check of data quality. 

3. 32-Tile Results 

The accuracy of calibration relations (2.1) and (2.3) will in general be diminished by any RFI 
in the coherency matrices. In this section some examples will be given showing the effect of both 
RFI and blanking. Data come from 21 of the 32 antennas and 6.2 MHz of bandwidth centred at 
98 MHz. The antenna tiles have been steered to a field with a right ascension of 21.2 hours and a 
declination of -25.4 degrees, close to transit. The dominant object in the field is extragalactic radio 
source PKS J2 107-2526, which is 61 Jy at 80 MHz [9]. This is one of the calibration pointings 
from a survey of the entire southern sky, which is currently under construction. 

In the absence of RFI, the validity of the model given in (2.1) may be verified by inspecting 
noise matrices. The matrices that minimise (2.2) can be used to remove the calibrator power from 
the measured coherency matrix (known as peeling [10]), and if the assumptions hold then the 
remaining noise term should have a Gaussian distribution. The noise distribution after peeling is 
shown in figure 1. In this paper the fundamental accuracy of the calibration process is not being 
considered, and it is adequate to note that the noise distributions qualitatively match Gaussians. 
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Figure 1: Histograms of the noise in each element of the co- 
herency matrices - estimated by peeling the fitted source con- 
tribution - along with best- fit Gaussian curves. The peeling 
was done separately in each 40 kHz channel, and the statistics 
generated after combining the whole band (after dividing out 
a polynomial bandpass fit). Several histograms were averaged 
to reduce the scatter. 



In figure 2 the amplitude of Jones matrix element //(l, 1) is shown as a function of time and 
frequency for one of the MWA antennas. To avoid known artifacts of the prototype receiving sys- 
tem, specific frequency channels were always blanked for these data, which can be seen as vertical 
bands of zero gain. About halfway though the scan RFI becomes evident and increases in power 
as the scan continues (the gain between FM radio frequencies is also affected by contamination of 
the bandpass fits). 

It is clear that blanking has significantly reduced the RFI in these data. Jones matrix errors 
have been reduced by 30 dB, leaving gains that are stable in both frequency and time. During the 
worst of the RFI only about 12 percent, or ^0.76 MHz of bandwidth, was blanked. The quality 
of the unblanked data can be seen in figure 3, where images made with and without blanking are 
compared. The calibrator is completely buried in the presence of RFI, but can be imaged and peeled 
below the noise floor with the RFI flags. 
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Figure 2: Jones matrix element //(l, 1) is shown as a function of time and frequency for MWA antenna 
tile 1, with dashed vertical lines indicating possible FM radio frequencies. The image on the left-hand side 
shows fitted gain amplitude without RFI blanking, while the image on the right-hand side shows the gain 
amplitude after all detected outliers were blanked. Blanking has maintained the smooth gain structure into 
the time interval that contains the strong RFI, and the entire time interval can be accurately calibrated and 
imaged. 



4. Conclusion 

The MWA employs a real-time calibration and imaging system to cope with a large data rate 
and fast-changing calibration parameters. While the MWA is situated in the radio-quiet Murchison 
Radio-astronomy Observatory, it will observe in frequency bands that are occupied by numerous 
communication signals. While these signals are transmitted from distant locations, they will, on 



5 



The Murchison Widefie Id Array 



Daniel A. Mitchell 




Right ascension (hours) Right Ascension (hours) 



-400 -300 -200 -100 100 200 300 400 -15 -10 -5 5 10 15 

Jy/bea m Jy/bea m 

Figure 3: On the left is an image of the J2 107-2526 field produced by integrating 8-second snapshots over 
the entire time interval without blanking. On the right is an image of the field after RFI blanking and peeling, 
along with contours of the unpeeled image. 

occasion, reflect or refract into the receivers at levels that are orders of magnitude above the noise 
floor. During deep integrations the MWA real-time system will simply discard dubious data. This 
will require a series of data-quality tests, of which the simple median-based detector shown here 
will form an integral part. 
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